Cosmological constraints combining H(z), CMB shift and SNIa observational data 
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Recently H(z) data obtained from differential ages of galaxies have been proposed as a new geo- 
metrical probe of dark energy. In this paper we use those data, combined with other background tests 
(CMB shift and SNIa data), to constrain a set of general relativistic dark energy models together 
with some other models motivated by extra dimensions. Our analysis rests mostly on Bayesian 
statistics, and we conclude that LCDM is at least substantially favoured, and that extradimensional 
models are less favoured than general relativistic ones. 



I. INTRODUCTION 

The almost undisrupted flow of data potentially useful 
for cosmological model selection and parameter fitting 
provides day by day a clearer picture of the evolution 
of the Universe, which according to a wide consensus 
seems to be currently accelerating. Yet it is uncertain 
how strong the conclusions of these sorts of investiga- 
tion are, as there are not only statistical uncertainties to 
worry about, but possible theoretical biasing in the tests 
used. 

This being the situation, the community is eager to be 
able to devise new tests and/or improve the understand- 
ing of the shortcomings of the existing ones towards re- 
fining them. The underlying hope is that, at least from 
a phenomenological perspective, cosmologists will even- 
tually be able to tell with high precision how fast the 
Universe is expanding at present, how long this speed up 
has lasted, and how the acceleration rate has changed 
over the recent past. 

Recently Simon et al. [l| have published Hubble func- 
tion H(z) data extracted from differential ages of pas- 
sively evolving galaxies. The use of these data to con- 
strain the background evolution of the Universe is inter- 
esting for several reasons. First, it can be used together 
with other cosmological tests in order to get useful con- 
sistency checks or tighter constraints on models. Second, 
in contrast to standard candle luminosity distances or 
standard ruler angular diameter distances, the Hubble 
function is not integrated over. This will allow, mostly 
in the future, when the systematic errors associated with 
the cluster physics have been reduced, to get cleaner con- 
straints on the cosmological parameters. The same data 
have also been recently used in 0, H[ to constrain several 
cosmological models. In this paper we use them in com- 
bination with the latest SNe la set compiled by Davis 
et al. [H and the CMB shift parameter as calculated by 
Wang and Mukherjee [fj from WMAP 3 [fj|. In contrast 
to other works, we have chosen not to use the baryon 
acoustic oscillations (BAO) data because, as pointed out 
in 0], there is a level of uncertainty in the use of the 
measure A given in 0] to test non-LCDM models. 

We have chosen to put constraints on 6 different cos- 



mological models that show late-time acceleration (see 
e.g. [3, [l(J El)- For all of them we have assumed flat 
space, consistent with the inflationary prediction that the 
curvature density parameter ~ [12j | and with the re- 
sults obtained by CMB experiments [6l [l3| . 



Three of the models are dark energy models while in 
the remaining three models the acceleration arises from 
or is modified by a five-dimensional modification of grav- 
ity motivated by extra dimensional physics. One also can 
group these models by pairs according to the number of 
parameters apart from Hq. LCDM and DGP 0, 
have only one parameter, i.e. the matter density param- 
eter. QCDM, i.e. dark energy parametrized as a perfect 
fluid with a constant equation of state, and LDGP, a non 
self-accelerating solution for the DGP action [l||, add to 
the matter density one further parameter: the equation 
of state and the crossover scale (i.e. the scale at which the 
5-dimcnsional behaviour becomes effective), respectively. 
The last two models chosen are the Chevalier-Polarski- 
Lindcr parametrization (TtI Il8j (which contains two pa- 
rameters characterizing the evolution of the equation of 
state) and the QDGP model (l9| . a phenomenological 
extension of the LDGP model, where the cosmological 
constant is replaced by a dark energy fluid with a con- 
stant equation of state. Similar analyses of a subset of 
these models using recent datasets partially overlapping 
with ours have been done in [2(| [U, [22j and our work 
may be complementary to that. 



The plan of the paper is as follows. In Sec. [IT] we out- 
line the models to be investigated and the correspond- 
ing forms of H(z). Then, in Sec. IIIII we give details of 
the three datasets considered, and we explain how they 
can be used to constrain the background of cosmological 
models. After that, in Sec. IIVI we review the concepts 
and techniques we resort to in our statistical analysis of 
the data. Finally, we present our results and conclusions 
in Sec. [V] (accompanied by some additional statistical 
information and considerations about the effective equa- 
tions of state in the different models in the Appendices). 



2 



II. MODELS AND PARAMETRIZATIONS 

As discussed in detail in [23j . extracting information 
about the properties of dark energy from a reconstruction 
of H(z) is not in principle problematic from the math- 
ematical perspective, but it is hard in practice because 
one only has at hand a discrete set of noisy data (for 
several redshifts) from which one wishes to infer informa- 
tion valid for all rcdshift values within a given interval. 
The various approaches to confronting models with data 
are based on deriving the luminosity distance Dl( z) [24j |. 

w de (z) [U EE US E| or h ( z ) [H M, M, H3, m ■ 

Here we will use the second approach and compare 
to observations six cosmological models altogether: the 
models can be grouped in pairs according to the number 
of parameters (one, two or three); in addition each pair 
will contain one general relativistic dark energy model 
and one model inspired by extradimcnsional modifica- 
tions to gravity. 



A. LCDM 

In general, if we consider a general relativistic model 
of dark energy with an equation of state that depends on 
the redshift, Wd e (z), the expression for H 2 (z) obeys 



H 2 (z) 
H 2 Q 



+ (1 - £l m ) exp 



1 + w de {x) 



1 



fix 



, (1) 



where one assumes that in addition to dark energy the 
Universe contains dust (dark matter and baryons). 

The LCDM model corresponds to the choice Wd e {z) = 
— 1, i.e. dark energy is a cosmological constant A. Thus 



H 2 (z) 
Hi 



i - a 



(2) 



and the free parameter of the model is £1 TO . LCDM is 
consistent with all data, but there is a theoretical prob- 
lem in explaining the observed value of A. 



B. DGP 

The dark gravity model inspired by the Dvali- 
Gabadadze-Porrati (DGP) braneoworld model was 
given by Deffayet [la ], and it represents a simple alter- 
native to the standard LCDM cosmology, with the same 
number of parameters. 

In this model the late Universe self-accelerates, not 
because of dark energy, which is absent, but rather due 
to an infrared modification of gravity. Explicitly one has 



H(z) 
Ho 



1 - fir. 
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+ n m (i + z) 3 . (3) 



One can define an effective dark energy equation of 
state w e e by imposing H 2 = i?gf2 m (l + z) 3 + 8TrGp e g/3 
along with p c g + 3H (l + w c g)p c g = to obtain, implicitly 
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where E(z) = H(z)/Hq, and explicitly 
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, , n m - 1 - v/(i - n m y + 4n m (i + ip 

w c g(z) — . . (5) 

2 v /(i-a,o 2 + 4a„(i + ^) 3 

For this scenario one can see that lim z ^_i w e g = —1, so 
the final asymptotic state is a de Sitter model. 



C. QCDM 

This model arises from the simplest generalization of 
LCDM, which consists in taking a constant value of 
Wde — w different, in general, from —1, so one gets 



H 2 {z) 



n m (i + zf + (i - n m )(i + z) 3( - 1+w \ (6) 



Despite its simplicity, this model can prove useful, as a 
preference for this parametrization over the LCDM case 
will provide support for the evolutionary nature of dark 
energy. 



D. LDGP 

This model represents the non self-accelerating branch 
of DGP (the two separate branches of DGP arise from 
the two possible ways to embed the 4D brane universe in 
the 5D spacetime; see also [TH, [29| for further explana- 
tion) . To generate acceleration a cosmological constant is 
needed. There are two main features of this model. One 
is the screening effect on A due to the presence of the 
extra dimension, that allows for a higher value of the cos- 
mological constant. The other is the possibility of having 
effective phantom behaviour without any phantom field 
and any of the associated instabilities. In addition, this 
model does not have ghosts, unlike the self-accelerating 
DGP. 

The evolution of the model is governed by the relation 



H, 



(7) 



where on the one hand SI a = 1 + 2y / F^T — Q m for the 
requirement of flatness (see 29] ) , and on the other hand 
the parameter O rc is related to the crossover scale which 
signals the transition from the general relativistic to the 
modified gravity regime (see again [l6| for details). Ob- 
servational constraints on this particular model using SN, 
CMB shift and BAO data were studied in I29|, where it 
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was found that statistically the best fit corresponded to 
the LCDM limit of the model. 

The same procedure to obtain w e g that we mentioned 
in the DGP model applies here, so one will have, from 

Eq. dm, 



w e g(z) 



E(z)) 



(8) 



E. Chevallier-Polarski-Linder Ansatz 

This is a widely used Ansatz, which generalizes QCDM 
to allow for evolution in the dark energy equation of state, 
as required by most realistic scalar field models. (It was 
first discussed in (l7j , and reintroduced independently in 

00 

Explicitly it corresponds to Wd c {z) = wq + wi(l — (1 + 
z)^ 1 ) and therefore 



H 2 (z) 
Hi 



n m (i + z) 3 

+ (1 - Q m )(l + zf (1+Wo+w ^e' 3 ^. (9) 



Among its desirable features two stand out: first, this 
parametrization of dark energy remains finite at large 
redshifts, and second, the physical interpretation of the 
model is simple as the parameter w\ is a measure of the 
scalar field potential slow roll factor V' /V in the case of 
quintessence [l8j . 



F. QDGP 

This model, introduced in [19], is a generalization of 
LDGP where the cosmological constant is replaced by 
dark energy with a constant equation of state w, in gen- 
eral different from — 1, that represents a new parameter 
of the model. The modified 4D Friedman equation has 
the following form: 



Hjz) 



= y/n m (i + zf + n w (z) + n rc - ^/aZ (10) 



where Q w (z) = (1 + 2^0~- fi m )(l + z) 3 ( 1+w \ Clearly, 
in this case too one can make use of Eq. (Q| to derive 
w c s- 



W c ff 



(1 + w)ft w (z)E(z) - y^l~n m (l + z) 3 
M^\\OJz) - 2£(z) v /0~T 



(11) 



III. OBSERVATIONAL TESTS 

A. Hubble parameter observations 

The Hubble parameter depends on the differential age 
of the Universe in terms of redshift, specifically 

< 12) 
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Thus, measuring dt/dz allows us to determine H(z). As 
described first in [3(| and in p| [3l[, it is possible to 
use absolute ages of passively evolving galaxies to com- 
pute values of dt/dz. The galaxy spectral data used by 
[l[ come from the Gemini Deep Deep Survey [32j and 
archival data (33|. In broad terms, the authors of these 
references bin together galaxies with a redshift separa- 
tion which is small enough so that the galaxies in the bin 
have roughly the same age; then, they calculate age dif- 
ferences between bins which have a small age difference 
which is at the same time larger than the error in the age 
itself [l[ . The outcome of this process is a set of 9 values 
of the Hubble parameter versus redshift (see Table U]). A 
particularly nice feature of this test is that differential 
ages are less sensitive to systematic errors than absolute 
ages [IH. 



TABLE I: H(z) data from [J (in units of kmr^pc" 1 ) 



z 


H(z) 


a 


0.09 


69 


12 


0.17 


83 


8.3 


0.27 


70 


14 


0.4 


87 


17.4 


0.88 


117 


23.4 


1.3 


168 


13.4 


1.43 


177 


14.2 


1.53 


140 


14 


1.75 


202 


40.4 



Observed values of H{z) can be used to place con- 
straints on different models of the expansion history of 
the Universe by minimizing the quantity 



xl{H a ,{6 i \) = 



9 

E 



{Hizj-je^-H^z^f 



(13) 



This test has already been used to constrain several 
cosmological models in 0, Q, and it does not seem to 
provide tight constraints on its own, so it seems neces- 
sary to combine it with other tests. Now, since one of 
the other tests we are using (the CMB shift) does not 
constrain H$ we will regard this parameter as a nuisance 
one, and so it will be more convenient to marginalize over 
it (see for instance 35]) and work with the quantity 



*h(W) = 



"21og(7 Tz H {H Q )e-^ H "^dH^j .(14) 
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The quantity tth(Hq) is a so called prior probability 
function (see section IIV|) which reflects some previous 
knowledge about preferred values of Hq. We have in- 
dependent evidence from the Chandra X-ray observa- 
tory giving H = 77 ± 4 kms^Mpc" 1 36 . This re- 
sult is consistent with the 3 year WMAP dataset result 
H = 73±3 kms^Mpc" 1 and with the Hubble Space 
Telescope key project result Hq = 73 ± 8 kms _1 Mpc _1 
[37l | . Here we choose to use the Chandra X-ray result (for 
statistical independence with respect to the other tests 
to be used later) and use this result to place a Gaussian 
prior on Hq so that 



tth(Ho) 



2-kuho 



(15) 



where obviously Hq = 77 and au Q = 4 



B. The Davis et al. 2007 dataset 

This dataset is one of the latest supernovae catalogs to 
be completed. It consists of 192 SNe classified as type la 
up to a redshift of z = 1.755 The authors combine 
the set compiled by |38| and the 30 new SNe data at high 
redshift (0.216 < z < 1.755) recently discovered with the 
Hubble Space Telescope (HST) [ijj. The first of those 
two sets consists of the addition of three smaller datasets. 
The first subset is made of the SNe la data from the 
ESSENCE project [38(, a ground-based survey designed 
to detect about 200 SNe la in the range z = 0.2-0.8. The 
second is made of the Supernova Legacy Survey (SNLS) 
[39l ] data. These have been refitted by 38] with the same 
lightcurve fitter used for the ESSENCE data. The third 
subset corresponds to the nearby SNe already presented 
in 0, HO, El| as refitted and used by [38| . As regards the 
HST SNe, it has been necessary to perform a normal- 
ization since the refitting of the HST lightcurves with 
the fitter used for the ESSENCE data is still in progress. 
The normalization has been done using the low redshift 
sample common to both sets and the error in the nor- 
malization is included in the distance errors for the HST 
SNe. Summarizing, the Davis et al. dataset consists of 60 
ESSENCE supernovae, 57 SNLS supernovae, 45 nearby 
supernovae and 30 HST supernovae. It is available at 
http : //www . dark- cosmology . dk/archive/ SN , 
http: //braeburn.pha. jhu. edu/~ariess/R06 and 
http : //www. ctio .noao . edu/essence . 

The statistical best fits of theoretical forms of H (z) us- 
ing supernovae data rest on the definition for the distance 
modulus 



MthC?«) = 51og 10 (dL(z; {0i})) + ^ (16) 



where 



d L {z;{6i}) = (l + z) 



Hodz 



H(z;H ,{6i}) 



(17) 



is the dimensionless luminosity distance which is related 
to its dimensional counterpart through -D_l(z; i?o, {$i}) = 
cd L (z;{0i})/H o . 

The best fits are obtained by minimizing the quantity 
(see Sec. |TV} 

2 / ra\\ OfthCfjj MO, {Oi}) ~ Mobs^j)) 2 / -| o\ 



where the a^j are the measurement variances [4J. The 
nuisance parameter /iq encodes the Hubble parameter 
and the absolute magnitude M [42j and has to be 
marginalized over [35| . So one will actually be working 
with the quantity 

Xsn(W) = -2 log (| e-^^-^dfio) ■ (19) 

A frequently used alternative [43| , consists in minimizing 
the quantity 



XSN^})^-- 2 
C3 

with respect to the other parameters. Here 

(/i t h(zj,Mo = 0, {9i}) - Hobs(zj)) 2 



192 

3=1 

192 

C3 = 



Hth(zj, ^0 = 0, {9i}) - Hobs(zj) 



192 



(20) 



(21) 



(22) 



(23) 



It is trivial to see x| N is just a version of Xsn minimized 
with respect to To that end it suffices to notice that 



Xsn(M0, {Oi}) = Ci - 2c 2 M0 + C3Mo ; 



(24) 



which clearly becomes minimum for /io = C2/C3, and so 
we can see X§ N ({6>;}) = x|n(^o = 0, {#*}). Furthermore, 
one can check that the difference between Xsn an< ^ Xsn 
is negligible and therefore we will just use the first of 
them. 



C. The CMB shift 

The CMB shift R is arguably the parameter with least 
model-dependence among those which can be inferred 
from CMB data, provided that the dark energy density 
parameter is negligible at recombination, and it does not 
depend on Ho. It is directly proportional to the ratio of 
the locations of the first peak in the temperature angular 
power spectrum in the model one wants to characterize 
(if T ) and in a reference flat SCDM model (lf rT ), that is, 



R 




dz 



(25) 
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where z rec is the redshift of recombination (a factor of 
2 has deliberately been introduced so as to reconcile the 
different definitions of the shift parameter in the litera- 
ture). The approximate expression in the last equality of 
(|25[) is the definition given in [44j and when we turn to 
numerical tests we will take it as exact, i.e. we will ignore 
the fact that it is only approximately equal to other defi- 
nitions. In the following lines we will show how to obtain 
this approximate expression for R. In an arbitrary model 
one has 



Ts (^rcc) 



(26) 



where the last scattering sound horizon scale r s (z lec ) is 
given by 



^(^rec) Otri 



c s (a)da 
a 2 H(a) ' 



(27) 



with c s the sound speed in the model, and (under the as- 
sumption of flatness) the sound horizon angular diameter 
distance is in turn given by 



1 + z„ 



dz 

wry 



(28) 



Arguably, one can take c s (a) to be constant, and in ad- 
dition we will take a rcc and a cq to be model indepen- 
dent. Approximate expressions will be obtained using 
1 3> a rcc 3> a cq . For a fiducial matter-radiation model 
one gets 1 




(29) 



but if one makes the more stringent assumption that the 
fiducial model is a SCDM one (£l' m = 1) then from the 
latter it follows that 



r s ( z rcc) 



and 



£s_ 3/2 
Tj roc ! 



2c 



(30) 



(31) 



(32) 



Hq vpm 

so using the definition in Eq. (|26p combined with Eqs. 
(|28|30|31|32|) . one obtains Eq. (25]). Constraints on the 
parametrizations of H(z) using reported values of the 



On the other hand, and in an arbitrary model 

T s (^rec) 



CMB shift (based on observations) will be obtained from 
minimization of the quantity (see Sec. HVj) 



(R(z rec ; {9i}) — R bs(z ICC ))^ 



(33) 



We use the value R = 1.70 ± 0.03 of the CMB shift pa- 
rameter calculated by [5| from the WMAP 3 data [6J] (for 
the computed value z rcc = 1090). Recently some points 
regarding the use of the CMB shift parameter have been 
raised and discussed by [It], IH, [43] and this is an is- 
sue that deserves additional clarification. Furthermore, 
one must be cautious, when weighting the conclusions 
reached using the CMB shift as it extends the redshift 
integration out to very large redshift, so that arbitrary 
parametrizations called to be applicable for such large 
redshift range can be very sensitive to errors. 



IV. 



STATISTICS AND DATA ANALYSIS 



Parameter estimation in the context of a given (cosmo- 
logical) model depending on some parameters has two 
ultimate goals: one is the determination of the "most 
likely" values of the parameters to yield a series of avail- 
able observational data, the other is measuring our degree 
of confidence in the fact that those data where generated 
by values of those parameters lying in an estimated in- 
terval. A related task is to compare different models 
using information retrieved in the parameter estimation 
process. The estimators used in both these jobs (param- 
eter estimation and model comparison) are different in 
the two main approaches to statistics (frequentist and 
Bayesian). In the remainder of this section we give a 
short account of these topics. 



A. Parameter estimation 

By definition £({dj}\{9i}, Ai) represents the the un- 
normalized probability density function (aka likelihood) 
that one measures the data {dj} given the model M. 
is true and its parameters take values {9i} [50| . Even 
though we will try to keep the discussion in this section 
as general as possible, when we turn to analyze particular 
datasets we will assume as customary that the measure- 
ments are normally distributed around their true value 
so that 



£(H}|{0 i },M)oce^ 2 «^»/ 2 . 



(34) 



The probability density function p({8i}\{dj} , M.) of 
the parameters to have values {6i} under the assump- 
tion that the true model is M and provided that the 
available observational data are {dj} reads 
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1 We present our expression in a fashion more similar to that in 
Eq. (10) of Ref 1451 ] although it is completely equivalent to Eq. 



(18) in Ref. [H) 



J 



p({Bi}\{dj},M) 



J C({d j }\{9 i },MH{di},M)de 1 . 



(35) 



In the Bayesian framework p({9i}\{dj},A4) and and z is calculated from 
7r({#i}, M) are respectively called the posterior and prior 
probability density functions (pdf) [5(| HH, H3, [H, W\ . 
The prior pdf encodes all previous knowledge about 
the parameters before the observational data have been 
collected. It may be regarded as subjective up to a 
certain point, but its use is compulsory in the Bayesian 
approach, which is arguably the approach to be used for 
theoretical frameworks which do not admit repetition 
of experiments (we only have one universe to enquire 
about). 

Parameter estimation in the Bayesian framework is 
based on maximizing the posterior pdf p({8i}\{dj} , M), 
whereas in a "strict" frequentist approach one just max- 
imizes £({dj}\{8i} , Ai). When one uses flat priors in the 
Bayesian approach then the same conclusions are drawn 
from both approaches and then the difference turns to be 
conceptual only [5l|, [52], [H[ . Interestingly, this kind of 
prior, which is also called the top-hat prior, is the most 
popular one in usual practise, but this does not neces- 
sarily mean it gives a fair representation of the state of 
knowledge before the experiment is carried out, and this 
usually requires physical insight into the problem (some- 
times priors rest upon symmetry considerations) [531 ] 2 . 

The second step toward constraining parameters sat- 
isfactorily is to construct credible intervals [H, [55| . We 
will for a while simplify our notation and let 



p(0i)dBi = ((1 - 0.68)/2) x p(6i)dei. (40) 



P ({e i }\{d j },M) = p(d 1) ...,e n ). 



(36) 



In the Bayesian approach [54( the 68% credible inter- 



vals on the parameter 9i will be given as 9i = x_ y with 
x, y, z calculated as follows. Here x will be the median of 
the marginal probability density function 



p(0i) = J p(6i, . . . , e n )d6 x . . . ddi^dOi+x ...d6 n . (37) 
The median x is calculated from 

p(8i)ddi = 0.5 x / p(di)d8i . (38) 



Similarly, y is calculated from 

px — y 

P (9 l )d9 l = ((1 - 0.68)/2) x / p(9i)d9i , (39) 



2 Some people work in a "mixed" approach and consider priors but 
then follow frequentist procedures. 



For marginalized normalized likelihood functions (i.e. 
posteriors) and likelihood contours see Figs. [TlfTUl The 
parameters 9a and 9i U entering the definitions of y and z 
are respectively a lower and an upper bound which should 
either be given by some physical restriction, or alterna- 
tively be chosen so that the conditions p(9a) « and 
p(9iu) ~ are satisfied to the desired degree of accuracy. 
Hence, the probability to get the observed values of the 
different physical quantities into play is 0.68 if 9i lies in 
the range [x — y, x + z]. The modification required so as 
to calculate the 95% credible interval is straightforward. 

In cases with strongly asymmetric posteriors, one will 
have to resort to alternative ways to report credible inter- 
vals [5|| . This will be the case in two of the models under 
study in this paper, and details will be given below. 

In order to calculate the confidence intervals (corre- 
spondent of the credible intervals of Bayesian statistics) 
in the frequentist approach one has to calculate the 
mode of C(9i), which is the value of 9i which maximizes 
C{9i). We will denote this value as 6*i m d The boundaries 
of the 68% confidence interval are calculated by finding 
the two values of 9i for which £(#;) = C(9 iui< \)e^ 1 / 2 [55[, 
whereas the 95% confidence interval is obtained by the 
same recipe, except that the exponent —1/2 must be 
replaced by —2. We will not use this way of calculating 
confidence intervals because it is exact only in gaussian 
situations (see Eq. (|3~4"|) ), but we will rather do it the 
Bayesian way. 

In two of our models (specifically LDGP and QDGP) 
it happens that, due to nonphysicality reasons, the al- 
lowed region of one of the parameters (f£ rc ) is truncated 
by a theoretical cut. This sharp restriction demands a 
modification of the definition of credible intervals. Let 
us denote this parameter as 9+ for simplicity (and consis- 
tency with the notation in this section). The truncation 
is such that 9+i = #* m d, so the 68% credible interval on 
9+ will be reported as 9+ = x +z , where x will now denote 
<?*md and z will be calculated as 

p(0*)d0* = 0.68 X Jp(9+)d9+. (41) 

Table HIl summarizes results. 
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B. Model comparison 

A popular but not too refined way to rate goodness 
of models in the frequentist approach is to compare 
values of the quantity — 21ogmax{£(#i, . . . , n )}/dof = 



max{x 2 (#i 



,0 n )} where dof stands for the num- 



ber of degrees of freedom of the model, which in 
turn is the number of observational data point mi- 
nus the number of parameters. Nevertheless, in gen- 
eral max{£(#i, . . . , 6 n )} ^ £(6q md , . . . , 6» mnd ); if we let 
max{£(6>i, . . . , 8 n )} = C(9ihi, ■ ■ • , 9 n bi) then the vector 
(6*ibf , . . . , 6*nbf) will represent what is usually referred to 
as the best fit. 

In the Bayes approach, the preferred estimator is the 
evidence [53]. It does not rely exclusively on the best- 
fitting parameters of the model, in contrast, it informs 
about how well the parameters of the model fit the data, 
after doing an averaging over all the parameter values 
that were theoretically plausible before the measurement 
ever took place [58| . 

We denote Bayes evidence as £ (M) = p({dj}\M), and 
it is defined as the probability of the data {dj} given the 
model A4, that is, 

£(M) = J ir({6i}, M)C({dj}\{0i}, M)d6 1 ...d6 n (42) 

where n({6i},M) is the model's prior on the 
set of parameters, normalized to unity (i.e. 
(/ n({6i}, M)d9\ . . . d9 n = 1.) The most common 
choice is the top hat prior, so that one rewrites Bayes 
evidence as 



£(M) 



(43) 



Now, to realize of the usefulness of the Bayes evidence 
towards our final goal, which is model selection, note 
that in Bayesian statistics the preference of model A4i 
over model A4j given the data {dk} is estimated through 
the quotient 



p(Mi\{d k }) _ £i{Mi) MMi 



p(M 3 \{d k }) £j{Mj) TTj{Mj)' 



(44) 



On the other hand, the Bayes factor B^ for any two 

£i{Mi) 



models Mi and Mj is defined as 



Bi 



(45) 



so if, as in usual practise, one assumes no prior preference 
of one model over the other, that is, TTi(M.i) = irj(M.j) = 
1/2, one finally has 



p(Mj\{d k }) 
p(M 3 \{d k }) 



(46) 



We report values of the evidences and Bayes factor for 
the models considered here in Table HVl (see Table HTT1 for 
priors). Values of the Bayes factor could be used to at- 
tribute evidence to the model Mi against the model Ab- 
using Jeffreys' scale [H^] which regards evidences of one 
model against the other as not significant, substantial, 
strong or decisive depending on values of In(Bij). Re- 
cently a the refinement of this scale has been proposed 
by Wasserman [6(| ■ These scales are of course subjective 
and one can question how illustrating they actually are, 
but the underlying philosophy is that the Bayes factor is 
a ratio of odds in the case there is no preference of one 
model over the other, so for the benefit of the readers 
which prefer a more objective report of our results we 
give, in Table |V] the ratio of odds of different pairs of 
models among those under consideration, which we will 
express as Oij = Bij : 1 in the cases where Bij > 1, 
whereas for the sake of aesthetics we will use the alter- 
native look Oij = 1 : Bji. These expressions then must 
be interpreted in the sense that an odds ratio equivalent 
to B^ : 1 means the model i is Bij times more probable 
than model j, whereas and odds ratio of the form 1 : Bji 
means model i is Bji times less probable than model j. 

Before we close this section, an important remark is in 
order. The usual situation in cosmology is that for the 
task of constraining the parameters {9i} one has at hand 
more than one set of statistically independent observa- 



tional data, say {d^}, ■ . ■ {d k m> }\ in that case, one can 
resort to the joint probability density function 



ptieMd^M) x ... xp({^}|{4 m )},x) 



n{d k m) },M) = 



(47) 



Clearly, the latter rule can be used to generalize conve- 
niently the whole discussion above to this situation with 
availability of more than one dataset. 



RESULTS AND CONCLUSIONS 



J 



In this paper we have performed parameter estimation 
and model selection for a set of flat FRW cosmological 
models which could be suitable to explain the observed 
current acceleration. Half of the models stem from the 
assumption of a dark energy component in the Universe, 



whereas the other half is made by models motivated by 
extra dimensions and that include a gravitational compo- 
nent of the observed expansion acceleration. We analyze 
data coming from (most recent) SNIa luminosity, CMB 
shift and H{z) measurements. Even though most of our 



8 



TABLE II: Best fits, % 2 and Credible Intervals 



Model 



Best-fit 



x/dof 68% conf. int. 



95% conf. int. 



LCDM fi m = 0.26 



DGP 



fim = 0.23 



QCDM fi m = 0.24 
w = -0.94 



1.03 
1.08 
1.03 



O _ f| ne+0.02 n — n 9R+ 05 
J — —0.02 ic, m — —0.04 



fin 



0.23 



+0.02 
0.02 



Q - n OC+0 03 



-0.96 



+0.08 
0.09 



0.23 



+ 0.04 
0.04 



O _ n oc+0.06 



-0.96 



+0.15 
0.20 



LDGP fi m = 0.26 
fir, = 0.00 



1.04 



0,; 



26+ 03 

: 0.00 +0 02 



"71 



26+ 05 

: 0.00 +0 ' 05 



QDGP 



CPL 



fi m = 0.25 
fir c = 0.00 
w = -0.95 

fi m = 0.25 
wo = -1.08 
wi = 0.58 



1.04 



1.04 



fim 


— n OQ+0.03 

— U.ZO_ Q3 


$^777, 


= 0.23; 


fir c 


= 0.00+ 088 




= 0.00 


It) = 


: -0.8lt°;° S 7 


= 


-0.81 


fim 


= 0.261°$ 


k2m 


= 0.26; 



too = -1-05I 020 



Wl 



■ T7+ - 70 

u -°' -0.84 



+ 0.00 
-0.05 
+2.32 

+0.09 
-0.19 



+0.06 
■0.05 



wo = -1.05±g;l^ 



TABLE III: Priors 





Model 


Parameter ranges 








LCDM 


0.00 < fi m < 0.60 








DGP 


0.00 < fi m < 0.60 








QCDM 


0.00 < fi m < 0.60 , -1.50 < to 


< -0.33 






LDGP 


0.00 < fi m < 0.60, 0.00 < fi rc 


< 0.10 






CPL 


0.00 < fi m < 0.60, -1.50 < w < -0.33, 


0.00 < wi < 2.00 






QDGP 


0.00 < fi m < 0.60, -1.50 < w < -0.33, 


0.00 < fir c < 1.00 








TABLE IV: Bayes evidence 






Model 


£ 


In(Bi.DGp) ln(Bi t QCDM) Zra(.Bi, ldgp) ln(Bi : cPh) 


ln(Bi,QnGp) 


LCDM 


1.05 x 10" 46 


5.55 1.54 2.31 


1.83 


3.56 


DGP 


4.07 x 10~ 49 


0.00 -4.01 -3.24 


-3.73 


-2.00 


QCDM 


2.25 x 10~ 47 


0.00 0.77 


0.29 


2.01 


LDGP 


1.04 x 10" 47 


0.00 


-0.49 


1.24 


CPL 


1.69 x 10" 47 




0.00 


1.72 


QDGP 


3.02 x 10" 48 






0.00 



statistical treatment fits into the Bayesian approach, for 
completeness we also report results on some frequentist 
estimators. Our results are consistent with partially over- 
lapping work (e.g. [20l [2l|) done previously using differ- 
ent combinations of observations. 

One may think a priori the dominant source of error to 
be the H{z) data, as the percentage errors range from 8% 



to 20%, whereas for the SN data the lowest percentage er- 
ror is 0.3% and the highest is 1% on the one hand, and for 
the CMB shift single datum we have a percentage error of 
1.7%. On the other hand, since the % 2 construction from 
the H(z) data does not involve any integration, there 
might be a restraint in the propagation of errors coming 
from them. We are inclined to think this is actually the 
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TABLE V: Odds ratio (number of times a model is more probable than other) 



Model 


Oi,DGP 


Oi.QCDM 


Oi.LDGP 


C»,CPL 


Ci.QDGP 


LCDM 


258:1 


5:1 


10:1 


6:1 


35:1 


DGP 


1:1 


1:55 


1:26 


1:42 


1:7 


QCDM 




1:1 


2:1 


1:1 


7:1 


LDGP 






1:1 


1:2 


3:1 


CPL 








1:1 


6:1 



QDGP 1:1 



case as the size of errors in our study are comparable 
with those obtained in other works. 

In the light of our results we conclude, first of all, that 
the frequentist estimator x 2 /dof significantly disfavours 
the DGP model over all the other models considered. We 
compute the probability of getting values of the \ 2 n °t 
farther from the minimum than the minimum x 2 for all 
other models, as defined in [25[ and we get a probability 
P > 99%. Consistently we arrive at the same conclusion 
using Bayes evidence. On the other hand, if we com- 
pare LCDM with all the other models, it turns out that 
from the frequentist perspective it is as good as QCDM, 
whereas for LDGP, CPL and QDGP one gets exactly the 
same value x 2 /dof, a value which turns out to be just 
slightly larger than the one for LCDM. In conclusion, the 
frequentist analysis yields the result that LCDM is just 
marginally preferred over all the rest of the models, ex- 
cept for DGP which is clearly the least likely model of all 
six. 

Regarding the Bayesian analysis, the reader can decide 
whether to qualify our results using the adjectives in Jef- 
frey's scale or just stick to the objectivity of the numbers 
giving us the odds of the different models against the oth- 
ers, but at least in the case of DGP versus LCDM we dare 



to say it is difficult not to be carried by the impression 
that it is rather disfavored. 

Finally, our results on credible limits point us in the di- 
rection that this combination of data sets constrain very 
poorly models with two "dark energy" parameters, so pa- 
rameter constraints on CPL and QDGP seem not to be 
tight enough. 
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APPENDIX A: PROBABILITY DENSITIES 



In this appendix we present the plots of the probability densities of the six different models considered. As explained 
above £g i will represent the probability density obtained by marginalization over all the parameters of the model except 
for the parameter 0j. For the models with more than one parameter we also plot the likelihood contours associated 
to 68, 95 and 99% probability. For the models with 3 parameters we have fixed O m to be 0.25. 
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FIG. 1: Likelihood function for the flat LCDM model. The light (dark) shaded region indicates the error on Q m to the left 
(right) of the median. 




-0.6 



-0.8 



w -1 
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-1.4 




0.1 0.2 0.3 0.4 0.5 

Or, 



FIG. 3: Likelihood contours corresponding to the 68, 95 and 99% credible contours in the (w, Q m ) plane for the QCDM model. 





0.5 0.2 0.25 0.3 0.35 0.4 



FIG. 5: Likelihood contours corresponding to the 68, 95 and 99% credible contours in the (O m , f2 rc ) plane for the LDGP 
model. 
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FIG. 7: Likelihood contours corresponding to the 68, 95 and 99% credible contours in the (wo, Wi) plane for the CPL Ansatz. 
Q m has been fixed to 0.25. 
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FIG. 9: Likelihood contours corresponding to the 68, 95 and 99% credible contours in the (O rc , w) plane for the QDGP model. 
Q m has been fixed to 0.25. 
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APPENDIX B: EQUATION OF STATE PARAMETERS 

For the sake of illustration, we represent the best fits Wde(z) for the CPL model and w g(z) for the DGP and LDGP 
models. We do not present the plots for the LCDM and QCDM models as the corresponding plots are basically 
stripes of constant width as they correspond to non-dynamical dark energy. For every best fit we include in the plot 
an estimation of error propagation based on the assumption of no error correlation. As in our parameter constraints, 
we have various sources of complication. On the one hand, as in some models (CPL, for instance) we have found slight 
departures from gaussianity, we cannot use the standard error propagation formula, in addition the errors found are 
large in the case of some parameters also we put forward a modification of the standard formula in order to account 
for non-gaussianities and large errors. Assuming our constraints in a generic cosmological parameter 9\ are reported in 
the form #mid^Ae"'' with A9u and A9i U being positive quantities. We can give an estimate of the error in a quantity 
/(#!, . . . , 9 n ) derived from various such parameters by taking for the upper error the quantity 



^(max(A/ m ,-A^)) 2 (Bl) 
t=i 

and for the lower error the quantity 



^(min(A/ ltl ,-A/,0) 2 , (B2) 
where 

A/i„ =/(... 0(j_l) m d, Oimd + A9 iu , %+i) m d, ...)—/(... %-i) m d, #imd, %+l)md, • ■ ■ ) (B3) 

and 

Afil = /(. . . 0(i-l)md> 0imd ~ A6u, 0(<+l) m d, • ■ ■ ) — /(• • ■ %-l)md, &imd, %+l)md> • • • )• (B4) 

So, we have proposed an estimation of error based on finite differences; nevertheless, one can be more refined if 
errors are small, i.e. A9i U = 59i U and A9a — 59u , because then one can write 



and 




Finally, in the gaussian situations (A9i U = A9u = A9i) (as for instance DGP) we get Af u — Afi. 

In Fig. (TTTj) we depict the best fit equation of state parameter along with error stripes for the models with more 
than one parameter. A few points are worth mentioning regarding the LDGP and QDGP models so as to clarify the 
peculiarity of the result. First of all, one must keep in mind for both cases we have found the best fit to correspond 
to their LCDM limit, which translates into an exactly constant equation of state parameter. On the other hand, the 
LDGP may look even stranger, as no upper error stripe appears in its plot. The reason for this is well grounded 
analytically, as the LDGP model does not have the ability to cross the phantom divide [li|, so w e g cannot be 
error-propagated across that barrier. Our plot just gives a g raphical account of that analytical imposibility. 
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FIG. 11: Equation of state parameter as a function of redshift for models CPL, DGP, LDGP and QDGP (top to bottom). 
The black lines correspond to the best fit for each of them, and the gray bands indicate the errors (calculated by the finite 
differences method). 
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